Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 105395 |
| Missing cells | 827337 |
| Missing cells (%) | 35.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 24.5 MiB |
| Average record size in memory | 244.0 B |
Variable types
| NUM | 12 |
|---|---|
| UNSUPPORTED | 7 |
| CAT | 2 |
| BOOL | 1 |
operation_car has constant value "105395" | Constant |
operation_date has a high cardinality: 17065 distinct values | High cardinality |
index_train has 105395 (100.0%) missing values | Missing |
danger has 88907 (84.4%) missing values | Missing |
loaded has 105395 (100.0%) missing values | Missing |
operation_train has 105395 (100.0%) missing values | Missing |
rod_train has 105395 (100.0%) missing values | Missing |
ssp_station_esr has 105395 (100.0%) missing values | Missing |
ssp_station_id has 105395 (100.0%) missing values | Missing |
weight_brutto has 105395 (100.0%) missing values | Missing |
adm is highly skewed (γ1 = 40.75622229) | Skewed |
df_index has unique values | Unique |
index_train is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
loaded is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
operation_train is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
rod_train is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ssp_station_esr is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ssp_station_id is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
weight_brutto is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
receiver has 16781 (15.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-14 20:03:11.442945 |
|---|---|
| Analysis finished | 2021-04-14 20:03:47.663612 |
| Duration | 36.22 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 105395 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2103808.687 |
|---|---|
| Minimum | 8 |
| Maximum | 4189794 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 195002.9 |
| Q1 | 1063153 |
| median | 2046369 |
| Q3 | 3178158.5 |
| 95-th percentile | 4017198.3 |
| Maximum | 4189794 |
| Range | 4189786 |
| Interquartile range (IQR) | 2115005.5 |
Descriptive statistics
| Standard deviation | 1220516.961 |
|---|---|
| Coefficient of variation (CV) | 0.5801463642 |
| Kurtosis | -1.180530781 |
| Mean | 2103808.687 |
| Median Absolute Deviation (MAD) | 1041859 |
| Skewness | 0.04417338718 |
| Sum | 2.217309166e+11 |
| Variance | 1.489661651e+12 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 1574913 | 1 | < 0.1% | |
| 4025976 | 1 | < 0.1% | |
| 2492053 | 1 | < 0.1% | |
| 519154 | 1 | < 0.1% | |
| 2027144 | 1 | < 0.1% | |
| 4103814 | 1 | < 0.1% | |
| 862960 | 1 | < 0.1% | |
| 179842 | 1 | < 0.1% | |
| 88419 | 1 | < 0.1% | |
| 3497598 | 1 | < 0.1% | |
| 95867 | 1 | < 0.1% | |
| 3850314 | 1 | < 0.1% | |
| 146125 | 1 | < 0.1% | |
| 673437 | 1 | < 0.1% | |
| 3751538 | 1 | < 0.1% | |
| 2436720 | 1 | < 0.1% | |
| 4158914 | 1 | < 0.1% | |
| 2739820 | 1 | < 0.1% | |
| 2485864 | 1 | < 0.1% | |
| 3245669 | 1 | < 0.1% | |
| 2872288 | 1 | < 0.1% | |
| 4042336 | 1 | < 0.1% | |
| 2643551 | 1 | < 0.1% | |
| 3956318 | 1 | < 0.1% | |
| 2248348 | 1 | < 0.1% | |
| Other values (105370) | 105370 | > 99.9% |
| Value | Count | Frequency (%) | |
| 8 | 1 | < 0.1% | |
| 54 | 1 | < 0.1% | |
| 170 | 1 | < 0.1% | |
| 173 | 1 | < 0.1% | |
| 202 | 1 | < 0.1% | |
| 252 | 1 | < 0.1% | |
| 372 | 1 | < 0.1% | |
| 388 | 1 | < 0.1% | |
| 461 | 1 | < 0.1% | |
| 496 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4189794 | 1 | < 0.1% | |
| 4189741 | 1 | < 0.1% | |
| 4189635 | 1 | < 0.1% | |
| 4189581 | 1 | < 0.1% | |
| 4189566 | 1 | < 0.1% | |
| 4189530 | 1 | < 0.1% | |
| 4189514 | 1 | < 0.1% | |
| 4189494 | 1 | < 0.1% | |
| 4189484 | 1 | < 0.1% | |
| 4189436 | 1 | < 0.1% |
length
Real number (ℝ≥0)
| Distinct | 46 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.011785853 |
|---|---|
| Minimum | 0.78 |
| Maximum | 2.13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 0.78 |
|---|---|
| 5-th percentile | 0.87 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1.32 |
| Maximum | 2.13 |
| Range | 1.35 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1536514365 |
|---|---|
| Coefficient of variation (CV) | 0.1518616178 |
| Kurtosis | 14.46997813 |
| Mean | 1.011785853 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.401341235 |
| Sum | 106637.17 |
| Variance | 0.02360876394 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 66095 | 62.7% | |
| 0.87 | 16780 | 15.9% | |
| 1.06 | 9177 | 8.7% | |
| 0.85 | 3259 | 3.1% | |
| 1.22 | 1354 | 1.3% | |
| 1.36 | 1285 | 1.2% | |
| 1.85 | 764 | 0.7% | |
| 1.82 | 718 | 0.7% | |
| 1.41 | 712 | 0.7% | |
| 1.01 | 684 | 0.6% | |
| 0.86 | 616 | 0.6% | |
| 1.11 | 493 | 0.5% | |
| 1.32 | 386 | 0.4% | |
| 0.83 | 355 | 0.3% | |
| 1.27 | 341 | 0.3% | |
| 1.6 | 341 | 0.3% | |
| 1.67 | 333 | 0.3% | |
| 1.03 | 302 | 0.3% | |
| 1.35 | 281 | 0.3% | |
| 1.05 | 228 | 0.2% | |
| 1.83 | 219 | 0.2% | |
| 0.79 | 132 | 0.1% | |
| 1.73 | 116 | 0.1% | |
| 0.9 | 103 | 0.1% | |
| 1.71 | 79 | 0.1% | |
| Other values (21) | 242 | 0.2% |
| Value | Count | Frequency (%) | |
| 0.78 | 4 | < 0.1% | |
| 0.79 | 132 | 0.1% | |
| 0.82 | 1 | < 0.1% | |
| 0.83 | 355 | 0.3% | |
| 0.85 | 3259 | 3.1% | |
| 0.86 | 616 | 0.6% | |
| 0.87 | 16780 | 15.9% | |
| 0.9 | 103 | 0.1% | |
| 0.92 | 2 | < 0.1% | |
| 0.99 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2.13 | 2 | < 0.1% | |
| 1.92 | 13 | < 0.1% | |
| 1.89 | 12 | < 0.1% | |
| 1.85 | 764 | 0.7% | |
| 1.84 | 2 | < 0.1% | |
| 1.83 | 219 | 0.2% | |
| 1.82 | 718 | 0.7% | |
| 1.77 | 17 | < 0.1% | |
| 1.75 | 8 | < 0.1% | |
| 1.73 | 116 | 0.1% |
car_number
Real number (ℝ≥0)
| Distinct | 83016 |
|---|---|
| Distinct (%) | 78.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60277175.58 |
|---|---|
| Minimum | 24051609 |
| Maximum | 98098866 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 24051609 |
|---|---|
| 5-th percentile | 50064872 |
| Q1 | 53717187 |
| median | 57283392 |
| Q3 | 62878905 |
| 95-th percentile | 93939820.7 |
| Maximum | 98098866 |
| Range | 74047257 |
| Interquartile range (IQR) | 9161718 |
Descriptive statistics
| Standard deviation | 12788218.5 |
|---|---|
| Coefficient of variation (CV) | 0.2121568964 |
| Kurtosis | 2.476725353 |
| Mean | 60277175.58 |
| Median Absolute Deviation (MAD) | 4529067 |
| Skewness | 1.183226314 |
| Sum | 6.35291292e+12 |
| Variance | 1.635385323e+14 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 55864821 | 24 | < 0.1% | |
| 55927537 | 23 | < 0.1% | |
| 55822944 | 22 | < 0.1% | |
| 55701130 | 22 | < 0.1% | |
| 55626428 | 22 | < 0.1% | |
| 55822928 | 22 | < 0.1% | |
| 55864227 | 21 | < 0.1% | |
| 32020406 | 20 | < 0.1% | |
| 55997936 | 18 | < 0.1% | |
| 55864862 | 17 | < 0.1% | |
| 55864714 | 17 | < 0.1% | |
| 34164483 | 17 | < 0.1% | |
| 34164475 | 17 | < 0.1% | |
| 55851810 | 17 | < 0.1% | |
| 55954358 | 17 | < 0.1% | |
| 55954481 | 17 | < 0.1% | |
| 34161414 | 17 | < 0.1% | |
| 55924526 | 17 | < 0.1% | |
| 55952550 | 17 | < 0.1% | |
| 55864466 | 17 | < 0.1% | |
| 34165936 | 17 | < 0.1% | |
| 32020455 | 17 | < 0.1% | |
| 34155770 | 17 | < 0.1% | |
| 55750186 | 17 | < 0.1% | |
| 32020257 | 17 | < 0.1% | |
| Other values (82991) | 104929 | 99.6% |
| Value | Count | Frequency (%) | |
| 24051609 | 1 | < 0.1% | |
| 24077588 | 1 | < 0.1% | |
| 24173940 | 1 | < 0.1% | |
| 24187924 | 1 | < 0.1% | |
| 24197014 | 1 | < 0.1% | |
| 24198012 | 1 | < 0.1% | |
| 24220931 | 1 | < 0.1% | |
| 24269250 | 1 | < 0.1% | |
| 24286957 | 1 | < 0.1% | |
| 24306508 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 98098866 | 1 | < 0.1% | |
| 98098742 | 1 | < 0.1% | |
| 98098700 | 2 | < 0.1% | |
| 98098429 | 1 | < 0.1% | |
| 98098346 | 2 | < 0.1% | |
| 98098296 | 1 | < 0.1% | |
| 98098221 | 1 | < 0.1% | |
| 98098213 | 1 | < 0.1% | |
| 98098171 | 1 | < 0.1% | |
| 98098114 | 1 | < 0.1% |
destination_esr
Real number (ℝ≥0)
| Distinct | 1130 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 624 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 855538.1576 |
|---|---|
| Minimum | 10002 |
| Maximum | 998100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 10002 |
|---|---|
| 5-th percentile | 249302 |
| Q1 | 852002 |
| median | 925701 |
| Q3 | 970001 |
| 95-th percentile | 989309 |
| Maximum | 998100 |
| Range | 988098 |
| Interquartile range (IQR) | 117999 |
Descriptive statistics
| Standard deviation | 206440.2828 |
|---|---|
| Coefficient of variation (CV) | 0.2412987439 |
| Kurtosis | 7.188083939 |
| Mean | 855538.1576 |
| Median Absolute Deviation (MAD) | 57813 |
| Skewness | -2.755956288 |
| Sum | 8.963558832e+10 |
| Variance | 4.261759035e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 947005 | 5488 | 5.2% | |
| 989309 | 5414 | 5.1% | |
| 883809 | 4293 | 4.1% | |
| 967808 | 3473 | 3.3% | |
| 925701 | 3053 | 2.9% | |
| 852002 | 2871 | 2.7% | |
| 985505 | 2558 | 2.4% | |
| 801208 | 2540 | 2.4% | |
| 862305 | 2201 | 2.1% | |
| 987801 | 2157 | 2.0% | |
| 986103 | 1999 | 1.9% | |
| 985702 | 1977 | 1.9% | |
| 944007 | 1895 | 1.8% | |
| 840109 | 1723 | 1.6% | |
| 887800 | 1648 | 1.6% | |
| 983514 | 1529 | 1.5% | |
| 521001 | 1456 | 1.4% | |
| 817600 | 1413 | 1.3% | |
| 954704 | 1362 | 1.3% | |
| 864207 | 1347 | 1.3% | |
| 76404 | 1272 | 1.2% | |
| 942105 | 1257 | 1.2% | |
| 891806 | 1228 | 1.2% | |
| 984700 | 1112 | 1.1% | |
| 892103 | 1100 | 1.0% | |
| Other values (1105) | 48405 | 45.9% |
| Value | Count | Frequency (%) | |
| 10002 | 2 | < 0.1% | |
| 10303 | 2 | < 0.1% | |
| 11804 | 3 | < 0.1% | |
| 14906 | 2 | < 0.1% | |
| 15400 | 1 | < 0.1% | |
| 15805 | 11 | < 0.1% | |
| 16403 | 1 | < 0.1% | |
| 17001 | 4 | < 0.1% | |
| 17904 | 5 | < 0.1% | |
| 18409 | 14 | < 0.1% |
| Value | Count | Frequency (%) | |
| 998100 | 73 | 0.1% | |
| 996904 | 1 | < 0.1% | |
| 996302 | 8 | < 0.1% | |
| 993304 | 142 | 0.1% | |
| 993107 | 29 | < 0.1% | |
| 991205 | 3 | < 0.1% | |
| 991101 | 54 | 0.1% | |
| 990700 | 8 | < 0.1% | |
| 990607 | 4 | < 0.1% | |
| 990005 | 14 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.11079273 |
|---|---|
| Minimum | 20 |
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 20 |
| median | 20 |
| Q3 | 20 |
| 95-th percentile | 20 |
| Maximum | 99 |
| Range | 79 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.519491096 |
|---|---|
| Coefficient of variation (CV) | 0.07555600202 |
| Kurtosis | 2069.027137 |
| Mean | 20.11079273 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 40.75622229 |
| Sum | 2119577 |
| Variance | 2.308853192 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 20 | 103764 | 98.5% | |
| 26 | 1138 | 1.1% | |
| 27 | 325 | 0.3% | |
| 21 | 117 | 0.1% | |
| 99 | 30 | < 0.1% | |
| 25 | 13 | < 0.1% | |
| 22 | 5 | < 0.1% | |
| 24 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20 | 103764 | 98.5% | |
| 21 | 117 | 0.1% | |
| 22 | 5 | < 0.1% | |
| 24 | 3 | < 0.1% | |
| 25 | 13 | < 0.1% | |
| 26 | 1138 | 1.1% | |
| 27 | 325 | 0.3% | |
| 99 | 30 | < 0.1% |
| Value | Count | Frequency (%) | |
| 99 | 30 | < 0.1% | |
| 27 | 325 | 0.3% | |
| 26 | 1138 | 1.1% | |
| 25 | 13 | < 0.1% | |
| 24 | 3 | < 0.1% | |
| 22 | 5 | < 0.1% | |
| 21 | 117 | 0.1% | |
| 20 | 103764 | 98.5% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 88907 |
| Missing (%) | 84.4% |
| Memory size | 823.5 KiB |
| 1 | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 16488 | 15.6% | |
| (Missing) | 88907 | 84.4% |
gruz
Real number (ℝ≥0)
| Distinct | 360 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 190397.6961 |
|---|---|
| Minimum | 3009 |
| Maximum | 999993 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 3009 |
|---|---|
| 5-th percentile | 81188 |
| Q1 | 151446 |
| median | 161096 |
| Q3 | 221136 |
| 95-th percentile | 331016 |
| Maximum | 999993 |
| Range | 996984 |
| Interquartile range (IQR) | 69690 |
Descriptive statistics
| Standard deviation | 96117.23825 |
|---|---|
| Coefficient of variation (CV) | 0.5048235363 |
| Kurtosis | 6.842201859 |
| Mean | 190397.6961 |
| Median Absolute Deviation (MAD) | 20004 |
| Skewness | 1.889233573 |
| Sum | 2.00656324e+10 |
| Variance | 9238523489 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 161096 | 18138 | 17.2% | |
| 141092 | 7732 | 7.3% | |
| 161043 | 5151 | 4.9% | |
| 91118 | 4573 | 4.3% | |
| 161113 | 4369 | 4.1% | |
| 161132 | 3566 | 3.4% | |
| 214039 | 3465 | 3.3% | |
| 161128 | 3154 | 3.0% | |
| 211056 | 2995 | 2.8% | |
| 141162 | 2905 | 2.8% | |
| 221136 | 2867 | 2.7% | |
| 161016 | 2774 | 2.6% | |
| 151446 | 2772 | 2.6% | |
| 281048 | 2492 | 2.4% | |
| 81188 | 2476 | 2.3% | |
| 314059 | 2296 | 2.2% | |
| 3009 | 2231 | 2.1% | |
| 221066 | 2042 | 1.9% | |
| 391498 | 1696 | 1.6% | |
| 324116 | 1653 | 1.6% | |
| 161062 | 1424 | 1.4% | |
| 236038 | 1152 | 1.1% | |
| 232431 | 1064 | 1.0% | |
| 331016 | 1011 | 1.0% | |
| 316073 | 912 | 0.9% | |
| Other values (335) | 20478 | 19.4% |
| Value | Count | Frequency (%) | |
| 3009 | 2231 | 2.1% | |
| 11005 | 867 | 0.8% | |
| 12008 | 3 | < 0.1% | |
| 13000 | 31 | < 0.1% | |
| 14003 | 102 | 0.1% | |
| 15006 | 3 | < 0.1% | |
| 18019 | 13 | < 0.1% | |
| 18023 | 411 | 0.4% | |
| 18108 | 8 | < 0.1% | |
| 21079 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 999993 | 6 | < 0.1% | |
| 757325 | 1 | < 0.1% | |
| 756063 | 1 | < 0.1% | |
| 731062 | 3 | < 0.1% | |
| 725502 | 16 | < 0.1% | |
| 721041 | 1 | < 0.1% | |
| 711317 | 8 | < 0.1% | |
| 711285 | 46 | < 0.1% | |
| 711266 | 20 | < 0.1% | |
| 711035 | 79 | 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 823.5 KiB |
| 11 |
|---|
| Value | Count | Frequency (%) | |
| 11 | 105395 | 100.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 210790 | 50.0% | |
| . | 105395 | 25.0% | |
| 0 | 105395 | 25.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 316185 | 75.0% | |
| Other Punctuation | 105395 | 25.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 210790 | 66.7% | |
| 0 | 105395 | 33.3% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 105395 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 421580 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 210790 | 50.0% | |
| . | 105395 | 25.0% | |
| 0 | 105395 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 421580 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 210790 | 50.0% | |
| . | 105395 | 25.0% | |
| 0 | 105395 | 25.0% |
| Distinct | 17065 |
|---|---|
| Distinct (%) | 16.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 823.5 KiB |
| 2020-07-18 15:10:00 | 156 |
|---|---|
| 2020-07-10 16:14:00 | 139 |
| 2020-07-15 17:34:00 | 129 |
| 2020-07-26 12:57:00 | 128 |
| 2020-07-16 16:31:00 | 118 |
| Other values (17060) |
| Value | Count | Frequency (%) | |
| 2020-07-18 15:10:00 | 156 | 0.1% | |
| 2020-07-10 16:14:00 | 139 | 0.1% | |
| 2020-07-15 17:34:00 | 129 | 0.1% | |
| 2020-07-26 12:57:00 | 128 | 0.1% | |
| 2020-07-16 16:31:00 | 118 | 0.1% | |
| 2020-07-17 17:21:00 | 115 | 0.1% | |
| 2020-07-09 17:22:00 | 112 | 0.1% | |
| 2020-07-27 11:13:00 | 112 | 0.1% | |
| 2020-07-17 14:26:00 | 112 | 0.1% | |
| 2020-07-11 17:08:00 | 109 | 0.1% | |
| 2020-07-17 16:22:00 | 109 | 0.1% | |
| 2020-07-23 16:53:00 | 105 | 0.1% | |
| 2020-07-23 13:09:00 | 105 | 0.1% | |
| 2020-07-16 14:46:00 | 102 | 0.1% | |
| 2020-07-31 18:55:00 | 101 | 0.1% | |
| 2020-07-20 17:55:00 | 100 | 0.1% | |
| 2020-07-10 16:52:00 | 97 | 0.1% | |
| 2020-07-18 10:48:00 | 97 | 0.1% | |
| 2020-07-20 16:23:00 | 97 | 0.1% | |
| 2020-07-27 17:08:00 | 95 | 0.1% | |
| 2020-07-27 16:53:00 | 95 | 0.1% | |
| 2020-07-25 15:39:00 | 90 | 0.1% | |
| 2020-07-16 01:35:00 | 89 | 0.1% | |
| 2020-07-14 14:39:00 | 88 | 0.1% | |
| 2020-07-11 15:58:00 | 88 | 0.1% | |
| Other values (17040) | 102707 | 97.4% |
Unique
| Unique | 7225 ? |
|---|---|
| Unique (%) | 6.9% |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 611098 | 30.5% | |
| 2 | 310097 | 15.5% | |
| - | 210790 | 10.5% | |
| : | 210790 | 10.5% | |
| 1 | 156135 | 7.8% | |
| 7 | 146212 | 7.3% | |
| 105395 | 5.3% | ||
| 5 | 53876 | 2.7% | |
| 3 | 53454 | 2.7% | |
| 4 | 44455 | 2.2% | |
| 6 | 40184 | 2.0% | |
| 9 | 30940 | 1.5% | |
| 8 | 29079 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1475530 | 73.7% | |
| Dash Punctuation | 210790 | 10.5% | |
| Other Punctuation | 210790 | 10.5% | |
| Space Separator | 105395 | 5.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 611098 | 41.4% | |
| 2 | 310097 | 21.0% | |
| 1 | 156135 | 10.6% | |
| 7 | 146212 | 9.9% | |
| 5 | 53876 | 3.7% | |
| 3 | 53454 | 3.6% | |
| 4 | 44455 | 3.0% | |
| 6 | 40184 | 2.7% | |
| 9 | 30940 | 2.1% | |
| 8 | 29079 | 2.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 210790 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 105395 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| : | 210790 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2002505 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 611098 | 30.5% | |
| 2 | 310097 | 15.5% | |
| - | 210790 | 10.5% | |
| : | 210790 | 10.5% | |
| 1 | 156135 | 7.8% | |
| 7 | 146212 | 7.3% | |
| 105395 | 5.3% | ||
| 5 | 53876 | 2.7% | |
| 3 | 53454 | 2.7% | |
| 4 | 44455 | 2.2% | |
| 6 | 40184 | 2.0% | |
| 9 | 30940 | 1.5% | |
| 8 | 29079 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2002505 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 611098 | 30.5% | |
| 2 | 310097 | 15.5% | |
| - | 210790 | 10.5% | |
| : | 210790 | 10.5% | |
| 1 | 156135 | 7.8% | |
| 7 | 146212 | 7.3% | |
| 105395 | 5.3% | ||
| 5 | 53876 | 2.7% | |
| 3 | 53454 | 2.7% | |
| 4 | 44455 | 2.2% | |
| 6 | 40184 | 2.0% | |
| 9 | 30940 | 1.5% | |
| 8 | 29079 | 1.5% |
operation_st_esr
Real number (ℝ≥0)
| Distinct | 386 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 905907.0251 |
|---|---|
| Minimum | 830107 |
| Maximum | 998100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 830107 |
|---|---|
| 5-th percentile | 841703 |
| Q1 | 881408 |
| median | 894109 |
| Q3 | 932705 |
| 95-th percentile | 971502 |
| Maximum | 998100 |
| Range | 167993 |
| Interquartile range (IQR) | 51297 |
Descriptive statistics
| Standard deviation | 39481.62349 |
|---|---|
| Coefficient of variation (CV) | 0.04358242336 |
| Kurtosis | -0.805312884 |
| Mean | 905907.0251 |
| Median Absolute Deviation (MAD) | 29902 |
| Skewness | 0.1184575704 |
| Sum | 9.546901184e+10 |
| Variance | 1558798593 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 893106 | 11227 | 10.7% | |
| 864207 | 5203 | 4.9% | |
| 881408 | 5161 | 4.9% | |
| 831504 | 3818 | 3.6% | |
| 913206 | 3594 | 3.4% | |
| 917207 | 3355 | 3.2% | |
| 911605 | 3322 | 3.2% | |
| 968707 | 3298 | 3.1% | |
| 926206 | 2906 | 2.8% | |
| 961604 | 2867 | 2.7% | |
| 852801 | 2779 | 2.6% | |
| 884906 | 2774 | 2.6% | |
| 944702 | 2368 | 2.2% | |
| 861302 | 2146 | 2.0% | |
| 925701 | 2088 | 2.0% | |
| 944609 | 1710 | 1.6% | |
| 887904 | 1710 | 1.6% | |
| 862305 | 1403 | 1.3% | |
| 974407 | 1371 | 1.3% | |
| 955406 | 1261 | 1.2% | |
| 882506 | 1249 | 1.2% | |
| 960103 | 1115 | 1.1% | |
| 887603 | 1087 | 1.0% | |
| 941704 | 950 | 0.9% | |
| 883809 | 858 | 0.8% | |
| Other values (361) | 35765 | 33.9% |
| Value | Count | Frequency (%) | |
| 830107 | 77 | 0.1% | |
| 830200 | 28 | < 0.1% | |
| 830304 | 124 | 0.1% | |
| 830709 | 285 | 0.3% | |
| 831203 | 22 | < 0.1% | |
| 831400 | 75 | 0.1% | |
| 831504 | 3818 | 3.6% | |
| 831608 | 25 | < 0.1% | |
| 831805 | 9 | < 0.1% | |
| 832009 | 12 | < 0.1% |
| Value | Count | Frequency (%) | |
| 998100 | 7 | < 0.1% | |
| 996904 | 80 | 0.1% | |
| 990005 | 39 | < 0.1% | |
| 988908 | 353 | 0.3% | |
| 988306 | 28 | < 0.1% | |
| 988109 | 1 | < 0.1% | |
| 987905 | 2 | < 0.1% | |
| 987303 | 48 | < 0.1% | |
| 985308 | 31 | < 0.1% | |
| 984502 | 405 | 0.4% |
operation_st_id
Real number (ℝ≥0)
| Distinct | 386 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2000463875 |
|---|---|
| Minimum | 2000035090 |
| Maximum | 2002025611 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 2000035090 |
|---|---|
| 5-th percentile | 2000035194 |
| Q1 | 2000035966 |
| median | 2000037038 |
| Q3 | 2000038832 |
| 95-th percentile | 2001933226 |
| Maximum | 2002025611 |
| Range | 1990521 |
| Interquartile range (IQR) | 2866 |
Descriptive statistics
| Standard deviation | 791682.9162 |
|---|---|
| Coefficient of variation (CV) | 0.0003957496689 |
| Kurtosis | -0.2725703222 |
| Mean | 2000463875 |
| Median Absolute Deviation (MAD) | 1354 |
| Skewness | 1.314307336 |
| Sum | 2.108188855e+14 |
| Variance | 6.267618399e+11 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2000035966 | 11227 | 10.7% | |
| 2001930816 | 5203 | 4.9% | |
| 2000035194 | 5161 | 4.9% | |
| 2001930534 | 3818 | 3.6% | |
| 2000036356 | 3594 | 3.4% | |
| 2000036424 | 3355 | 3.2% | |
| 2000036342 | 3322 | 3.2% | |
| 2000038620 | 3298 | 3.1% | |
| 2000036888 | 2906 | 2.8% | |
| 2000038410 | 2867 | 2.7% | |
| 2001933476 | 2779 | 2.6% | |
| 2000035324 | 2774 | 2.6% | |
| 2000037816 | 2368 | 2.2% | |
| 2001930770 | 2146 | 2.0% | |
| 2000036868 | 2088 | 2.0% | |
| 2000035564 | 1710 | 1.6% | |
| 2000037808 | 1710 | 1.6% | |
| 2001930778 | 1403 | 1.3% | |
| 2000038762 | 1371 | 1.3% | |
| 2000038302 | 1261 | 1.2% | |
| 2000035232 | 1249 | 1.2% | |
| 2000038372 | 1115 | 1.1% | |
| 2000035530 | 1087 | 1.0% | |
| 2000037662 | 950 | 0.9% | |
| 2000035252 | 858 | 0.8% | |
| Other values (361) | 35765 | 33.9% |
| Value | Count | Frequency (%) | |
| 2000035090 | 4 | < 0.1% | |
| 2000035110 | 111 | 0.1% | |
| 2000035130 | 25 | < 0.1% | |
| 2000035140 | 57 | 0.1% | |
| 2000035162 | 4 | < 0.1% | |
| 2000035182 | 5 | < 0.1% | |
| 2000035194 | 5161 | 4.9% | |
| 2000035212 | 9 | < 0.1% | |
| 2000035218 | 9 | < 0.1% | |
| 2000035222 | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2002025611 | 8 | < 0.1% | |
| 2002025609 | 13 | < 0.1% | |
| 2002023503 | 1 | < 0.1% | |
| 2001933538 | 476 | 0.5% | |
| 2001933530 | 12 | < 0.1% | |
| 2001933522 | 65 | 0.1% | |
| 2001933502 | 135 | 0.1% | |
| 2001933498 | 1 | < 0.1% | |
| 2001933494 | 71 | 0.1% | |
| 2001933484 | 628 | 0.6% |
| Distinct | 1918 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27951350.75 |
|---|---|
| Minimum | 0 |
| Maximum | 99803052 |
| Zeros | 16781 |
| Zeros (%) | 15.9% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 186507 |
| median | 5785164 |
| Q3 | 61946814 |
| 95-th percentile | 95266723 |
| Maximum | 99803052 |
| Range | 99803052 |
| Interquartile range (IQR) | 61760307 |
Descriptive statistics
| Standard deviation | 34431644.65 |
|---|---|
| Coefficient of variation (CV) | 1.231841887 |
| Kurtosis | -0.9417545309 |
| Mean | 27951350.75 |
| Median Absolute Deviation (MAD) | 5785164 |
| Skewness | 0.8312485999 |
| Sum | 2.945736953e+12 |
| Variance | 1.185538154e+15 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 16781 | 15.9% | |
| 5785164 | 4215 | 4.0% | |
| 73116035 | 3409 | 3.2% | |
| 186720 | 2919 | 2.8% | |
| 79601286 | 2643 | 2.5% | |
| 186465 | 2536 | 2.4% | |
| 5757676 | 2251 | 2.1% | |
| 71479207 | 2183 | 2.1% | |
| 1126648 | 2157 | 2.0% | |
| 160206 | 2143 | 2.0% | |
| 12627615 | 1895 | 1.8% | |
| 95266723 | 1640 | 1.6% | |
| 1126631 | 1603 | 1.5% | |
| 20770562 | 1529 | 1.5% | |
| 97059520 | 1487 | 1.4% | |
| 186424 | 1403 | 1.3% | |
| 4622690 | 1370 | 1.3% | |
| 1126163 | 1323 | 1.3% | |
| 461379 | 1260 | 1.2% | |
| 74421763 | 1176 | 1.1% | |
| 105457 | 1176 | 1.1% | |
| 47859907 | 1163 | 1.1% | |
| 1126022 | 1094 | 1.0% | |
| 1126016 | 1049 | 1.0% | |
| 94737356 | 1034 | 1.0% | |
| Other values (1893) | 43949 | 41.7% |
| Value | Count | Frequency (%) | |
| 0 | 16781 | 15.9% | |
| 18595 | 114 | 0.1% | |
| 83262 | 3 | < 0.1% | |
| 105182 | 1 | < 0.1% | |
| 105199 | 124 | 0.1% | |
| 105207 | 130 | 0.1% | |
| 105213 | 308 | 0.3% | |
| 105236 | 61 | 0.1% | |
| 105414 | 1 | < 0.1% | |
| 105457 | 1176 | 1.1% |
| Value | Count | Frequency (%) | |
| 99803052 | 1 | < 0.1% | |
| 99769585 | 1 | < 0.1% | |
| 99426230 | 1 | < 0.1% | |
| 99332842 | 9 | < 0.1% | |
| 99294283 | 3 | < 0.1% | |
| 99029960 | 2 | < 0.1% | |
| 98891999 | 4 | < 0.1% | |
| 98780490 | 1 | < 0.1% | |
| 98768158 | 4 | < 0.1% | |
| 98754452 | 34 | < 0.1% |
rodvag
Real number (ℝ≥0)
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.0273068 |
|---|---|
| Minimum | 20 |
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 60 |
| median | 60 |
| Q3 | 70 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 79 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 15.28810658 |
|---|---|
| Coefficient of variation (CV) | 0.2387747875 |
| Kurtosis | 1.856866053 |
| Mean | 64.0273068 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.1499185088 |
| Sum | 6748158 |
| Variance | 233.7262027 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 60 | 66241 | 62.9% | |
| 70 | 15608 | 14.8% | |
| 90 | 8799 | 8.3% | |
| 20 | 4348 | 4.1% | |
| 96 | 3920 | 3.7% | |
| 40 | 3350 | 3.2% | |
| 95 | 1447 | 1.4% | |
| 93 | 1005 | 1.0% | |
| 92 | 419 | 0.4% | |
| 87 | 256 | 0.2% | |
| 99 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20 | 4348 | 4.1% | |
| 40 | 3350 | 3.2% | |
| 60 | 66241 | 62.9% | |
| 70 | 15608 | 14.8% | |
| 87 | 256 | 0.2% | |
| 90 | 8799 | 8.3% | |
| 92 | 419 | 0.4% | |
| 93 | 1005 | 1.0% | |
| 95 | 1447 | 1.4% | |
| 96 | 3920 | 3.7% |
| Value | Count | Frequency (%) | |
| 99 | 2 | < 0.1% | |
| 96 | 3920 | 3.7% | |
| 95 | 1447 | 1.4% | |
| 93 | 1005 | 1.0% | |
| 92 | 419 | 0.4% | |
| 90 | 8799 | 8.3% | |
| 87 | 256 | 0.2% | |
| 70 | 15608 | 14.8% | |
| 60 | 66241 | 62.9% | |
| 40 | 3350 | 3.2% |
sender
Real number (ℝ≥0)
| Distinct | 933 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38581587.72 |
|---|---|
| Minimum | 0 |
| Maximum | 99863723 |
| Zeros | 355 |
| Zeros (%) | 0.3% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 161246 |
| Q1 | 4788410 |
| median | 36836748 |
| Q3 | 74877457 |
| 95-th percentile | 94535486 |
| Maximum | 99863723 |
| Range | 99863723 |
| Interquartile range (IQR) | 70089047 |
Descriptive statistics
| Standard deviation | 34388017.35 |
|---|---|
| Coefficient of variation (CV) | 0.8913064336 |
| Kurtosis | -1.490371587 |
| Mean | 38581587.72 |
| Median Absolute Deviation (MAD) | 35777730 |
| Skewness | 0.2494814406 |
| Sum | 4.066036367e+12 |
| Variance | 1.182535738e+15 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 81213597 | 11195 | 10.6% | |
| 48134187 | 9310 | 8.8% | |
| 5757676 | 5511 | 5.2% | |
| 186720 | 4992 | 4.7% | |
| 98770511 | 3745 | 3.6% | |
| 13141274 | 3560 | 3.4% | |
| 160206 | 3544 | 3.4% | |
| 5785164 | 3420 | 3.2% | |
| 161246 | 3353 | 3.2% | |
| 161878 | 3321 | 3.2% | |
| 73844898 | 2856 | 2.7% | |
| 55472826 | 2839 | 2.7% | |
| 75533872 | 2368 | 2.2% | |
| 164517 | 1710 | 1.6% | |
| 57615980 | 1577 | 1.5% | |
| 282754 | 1379 | 1.3% | |
| 26635687 | 1133 | 1.1% | |
| 3434207 | 1112 | 1.1% | |
| 19053140 | 1080 | 1.0% | |
| 49216289 | 994 | 0.9% | |
| 78465421 | 991 | 0.9% | |
| 58734994 | 950 | 0.9% | |
| 7621060 | 784 | 0.7% | |
| 53086734 | 674 | 0.6% | |
| 44474 | 668 | 0.6% | |
| Other values (908) | 32322 | 30.7% |
| Value | Count | Frequency (%) | |
| 0 | 355 | 0.3% | |
| 44474 | 668 | 0.6% | |
| 83262 | 3 | < 0.1% | |
| 105236 | 1 | < 0.1% | |
| 108708 | 14 | < 0.1% | |
| 109783 | 1 | < 0.1% | |
| 160028 | 3 | < 0.1% | |
| 160206 | 3544 | 3.4% | |
| 160212 | 25 | < 0.1% | |
| 161186 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 99863723 | 10 | < 0.1% | |
| 99435157 | 17 | < 0.1% | |
| 99417515 | 1 | < 0.1% | |
| 99415491 | 194 | 0.2% | |
| 98770511 | 3745 | 3.6% | |
| 98102991 | 440 | 0.4% | |
| 97720043 | 26 | < 0.1% | |
| 97717058 | 8 | < 0.1% | |
| 97689014 | 3 | < 0.1% | |
| 97679381 | 260 | 0.2% |
tare_weight
Real number (ℝ≥0)
| Distinct | 214 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 244.4924807 |
|---|---|
| Minimum | 178 |
| Maximum | 590 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 823.5 KiB |
Quantile statistics
| Minimum | 178 |
|---|---|
| 5-th percentile | 223 |
| Q1 | 235 |
| median | 240 |
| Q3 | 249 |
| 95-th percentile | 272 |
| Maximum | 590 |
| Range | 412 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 22.01931227 |
|---|---|
| Coefficient of variation (CV) | 0.09006130663 |
| Kurtosis | 70.08852525 |
| Mean | 244.4924807 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 5.398951614 |
| Sum | 25768285 |
| Variance | 484.8501128 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 240 | 11448 | 10.9% | |
| 245 | 8207 | 7.8% | |
| 235 | 6155 | 5.8% | |
| 238 | 3600 | 3.4% | |
| 233 | 3392 | 3.2% | |
| 243 | 3365 | 3.2% | |
| 236 | 3302 | 3.1% | |
| 247 | 3128 | 3.0% | |
| 225 | 3062 | 2.9% | |
| 237 | 2706 | 2.6% | |
| 234 | 2578 | 2.4% | |
| 242 | 2480 | 2.4% | |
| 241 | 2469 | 2.3% | |
| 239 | 2264 | 2.1% | |
| 250 | 2196 | 2.1% | |
| 248 | 2193 | 2.1% | |
| 260 | 2086 | 2.0% | |
| 244 | 1985 | 1.9% | |
| 230 | 1905 | 1.8% | |
| 270 | 1838 | 1.7% | |
| 246 | 1649 | 1.6% | |
| 267 | 1644 | 1.6% | |
| 266 | 1626 | 1.5% | |
| 232 | 1606 | 1.5% | |
| 224 | 1601 | 1.5% | |
| Other values (189) | 26910 | 25.5% |
| Value | Count | Frequency (%) | |
| 178 | 7 | < 0.1% | |
| 179 | 4 | < 0.1% | |
| 180 | 11 | < 0.1% | |
| 181 | 8 | < 0.1% | |
| 182 | 2 | < 0.1% | |
| 183 | 2 | < 0.1% | |
| 184 | 26 | < 0.1% | |
| 185 | 6 | < 0.1% | |
| 186 | 8 | < 0.1% | |
| 187 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 590 | 2 | < 0.1% | |
| 588 | 47 | < 0.1% | |
| 587 | 31 | < 0.1% | |
| 586 | 15 | < 0.1% | |
| 585 | 13 | < 0.1% | |
| 584 | 6 | < 0.1% | |
| 583 | 1 | < 0.1% | |
| 580 | 1 | < 0.1% | |
| 475 | 2 | < 0.1% | |
| 459 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | index_train | length | car_number | destination_esr | adm | danger | gruz | loaded | operation_car | operation_date | operation_st_esr | operation_st_id | operation_train | receiver | rodvag | rod_train | sender | ssp_station_esr | ssp_station_id | tare_weight | weight_brutto | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 8 | NaN | 1.0 | 62845623 | 983514.0 | 20.0 | NaN | 161132.0 | NaN | 11.0 | 2020-07-16 16:44:00 | 913206.0 | 2.000036e+09 | NaN | 20770562.0 | 60.0 | NaN | 13141274.0 | NaN | NaN | 245.0 | NaN |
| 1 | 54 | NaN | 1.0 | 62842869 | 840109.0 | 20.0 | NaN | 161096.0 | NaN | 11.0 | 2020-07-16 13:46:00 | 893106.0 | 2.000036e+09 | NaN | 4622690.0 | 60.0 | NaN | 81213597.0 | NaN | NaN | 248.0 | NaN |
| 2 | 170 | NaN | 1.0 | 62832654 | NaN | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-15 21:19:00 | 852801.0 | 2.001933e+09 | NaN | 97728197.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 248.0 | NaN |
| 3 | 173 | NaN | 1.0 | 62832258 | 967808.0 | 20.0 | NaN | 161043.0 | NaN | 11.0 | 2020-07-16 13:25:00 | 913206.0 | 2.000036e+09 | NaN | 1126163.0 | 60.0 | NaN | 13141274.0 | NaN | NaN | 245.0 | NaN |
| 4 | 202 | NaN | 1.0 | 62828835 | 985702.0 | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-16 09:50:00 | 852801.0 | 2.001933e+09 | NaN | 10230304.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 244.0 | NaN |
| 5 | 252 | NaN | 1.0 | 62852504 | 983514.0 | 20.0 | NaN | 161132.0 | NaN | 11.0 | 2020-07-16 16:44:00 | 913206.0 | 2.000036e+09 | NaN | 20770562.0 | 60.0 | NaN | 13141274.0 | NaN | NaN | 245.0 | NaN |
| 6 | 372 | NaN | 1.0 | 62861976 | 817600.0 | 20.0 | NaN | 161043.0 | NaN | 11.0 | 2020-07-16 05:44:00 | 862305.0 | 2.001931e+09 | NaN | 186424.0 | 60.0 | NaN | 160206.0 | NaN | NaN | 243.0 | NaN |
| 7 | 388 | NaN | 1.0 | 62891726 | NaN | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-15 21:19:00 | 852801.0 | 2.001933e+09 | NaN | 97728197.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 249.0 | NaN |
| 8 | 461 | NaN | 1.0 | 62854542 | 967808.0 | 20.0 | NaN | 161043.0 | NaN | 11.0 | 2020-07-16 13:25:00 | 913206.0 | 2.000036e+09 | NaN | 1126163.0 | 60.0 | NaN | 13141274.0 | NaN | NaN | 245.0 | NaN |
| 9 | 496 | NaN | 1.0 | 62848973 | 840109.0 | 20.0 | NaN | 161096.0 | NaN | 11.0 | 2020-07-16 10:40:00 | 893106.0 | 2.000036e+09 | NaN | 4622690.0 | 60.0 | NaN | 81213597.0 | NaN | NaN | 245.0 | NaN |
Last rows
| df_index | index_train | length | car_number | destination_esr | adm | danger | gruz | loaded | operation_car | operation_date | operation_st_esr | operation_st_id | operation_train | receiver | rodvag | rod_train | sender | ssp_station_esr | ssp_station_id | tare_weight | weight_brutto | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 105385 | 4189436 | NaN | 1.0 | 62602651 | 944007.0 | 20.0 | NaN | 161096.0 | NaN | 11.0 | 2020-07-16 15:38:00 | 944609.0 | 2.000038e+09 | NaN | 12627615.0 | 60.0 | NaN | 164517.0 | NaN | NaN | 245.0 | NaN |
| 105386 | 4189484 | NaN | 1.0 | 62598461 | 840109.0 | 20.0 | NaN | 161096.0 | NaN | 11.0 | 2020-07-16 13:46:00 | 893106.0 | 2.000036e+09 | NaN | 4622690.0 | 60.0 | NaN | 81213597.0 | NaN | NaN | 245.0 | NaN |
| 105387 | 4189494 | NaN | 1.0 | 62599253 | 982600.0 | 20.0 | NaN | 161113.0 | NaN | 11.0 | 2020-07-16 17:59:00 | 917207.0 | 2.000036e+09 | NaN | 97059520.0 | 60.0 | NaN | 161246.0 | NaN | NaN | 245.0 | NaN |
| 105388 | 4189514 | NaN | 1.0 | 62597976 | 967808.0 | 20.0 | NaN | 161113.0 | NaN | 11.0 | 2020-07-16 12:22:00 | 917207.0 | 2.000036e+09 | NaN | 71479207.0 | 60.0 | NaN | 161246.0 | NaN | NaN | 245.0 | NaN |
| 105389 | 4189530 | NaN | 1.0 | 62822085 | NaN | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-15 21:19:00 | 852801.0 | 2.001933e+09 | NaN | 97728197.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 248.0 | NaN |
| 105390 | 4189566 | NaN | 1.0 | 62804869 | 985702.0 | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-16 09:50:00 | 852801.0 | 2.001933e+09 | NaN | 10230304.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 241.0 | NaN |
| 105391 | 4189581 | NaN | 1.0 | 62804729 | 985702.0 | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-16 09:50:00 | 852801.0 | 2.001933e+09 | NaN | 10230304.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 245.0 | NaN |
| 105392 | 4189635 | NaN | 1.0 | 62802418 | 840109.0 | 20.0 | NaN | 161096.0 | NaN | 11.0 | 2020-07-16 10:40:00 | 893106.0 | 2.000036e+09 | NaN | 4622690.0 | 60.0 | NaN | 81213597.0 | NaN | NaN | 247.0 | NaN |
| 105393 | 4189741 | NaN | 1.0 | 62823323 | NaN | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-15 21:19:00 | 852801.0 | 2.001933e+09 | NaN | 97728197.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 249.0 | NaN |
| 105394 | 4189794 | NaN | 1.0 | 62820030 | 985702.0 | 20.0 | NaN | 161016.0 | NaN | 11.0 | 2020-07-16 06:55:00 | 852801.0 | 2.001933e+09 | NaN | 461379.0 | 60.0 | NaN | 55472826.0 | NaN | NaN | 249.0 | NaN |